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(54) Title: DETERMINING CANCER-LINKED GENES AND THERAPEUTIC TARGETS USING MOLECULAR CYTOGE- 
NETIC METHODS 

(57) Abstract: Methods for identifying potential therapeutic agents, such as anti-tumor agents, based on their modulation of the 
expression of specified genes, especially genes mapping to specific chromosomal regions, are disclosed. Also described are methods 
for diagnosing cancerous, or potentially cancerous, conditions as a result of the expression, or patterns of expression, of such genes, 
including detecting changes in levels of gene copy number and/or level of amplification of the said gene, or sets of genes, to detect 
and/or diagnose the cancer. Methods for detecting or determining functionally related genes, as well as methods for treating cancer 
based on targeting expression products of such genes, determining genes involved in the cancerous process and the success and/or 
response rates and survival statistics for cancer patients on treatment are encompassed by the invention. Also encompassed are 
methods involving determining the modulated expression of the genes in these regions of interest (ROIs) as pharmacodynamic/phar- 
macogenetic/surrogate markers and/or for patient profiling prior to accrual for clinical trials/treatments based on the identification 
of these genes as validated gene/drug targets in various cancer tissue types. 
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DETERMINING CANCER-LINKED GENES AND 
THERAPEUTIC TARGETS USING MOLECULAR 
CYTOGENETIC METHODS 

5 



10 This application claims priority of U.S. Provisional Application Serial 

No. 60/462,895, filed 15 April 2003, the disclosure of which is hereby 
incorporated by reference in its entirety. 

15 

FIELD OF THE INVENTION 

20 The present invention relates to identification of genes whose 

disruption and/or change in expression is useful to distinguish cancerous from 
non-cancerous tissue and serve as potential therapeutic targets, 
pharmacodynamic /pharmacogenetic/surrogate and prognostic and diagnostic 
markers, and which genes are identified by high resolution Comparative 

25 Genomic Hybridization (CGH) and Spectral Karyotyping (SKY)/fluorescent in 
situ hybridization (FISH) analysis of DNA and chromosomes of various cancer 
cell lines and primary and metastatic tumor samples combined with gene 
expression analysis of these cells and tissues. 

30 

BACKGROUND OF THE INVENTION 

Chromosomal abnormalities have been identified in most cancer cells. 
Conventional chromosome banding techniques allow for the detection of 
35 specific chromosomal defects in tumor cells but interpretation of the banding 
pattern is sometimes difficult, particularly when complex chromosomal 
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rearrangements or subtle abnormalities are present. In recent years, new 
techniques, such as CGH and SKY, based on fluorescent in situ hybridization 
(FISH) (Pinkel et aL, Proc Nat Acad Sci USA 85:9138-42 (1988)) have been 
developed to overcome the limitations of conventional chromosome banding. 
5 CGH measures intensities of fluorescently labeled tumor DNA and normal 
DNA following hybridization to normal chromosomes (Kallioniemi et al., 
Science 258:818-21 (1992)). Gain or loss of copy number of a particular 
chromosome or chromosome region in the tumor DNA is determined by the 
relative intensity of a fluorescence ratio. SKY utilizes a cocktail of 

10 chromosome probes, fluorescently labeled to specify each chromosome, 
which is hybridized to tumor chromosomes in an effort to identify numerical 
and structural abnormalities in the tumor cell (Schrock et al., Science 
273:494-7 (1996)). CGH and SKY have been used to identify chromosomal 
regions that harbor genes significant to the process of tumor initiation or 

15 progression. 

BRIEF SUMMARY OF THE INVENTION 

20 In one aspect the present invention relates to a set of genes that have 

been localized within human chromosomal regions of interest (ROI) that have 
been identified by molecular cytogenetic techniques. 

In one aspect, the present invention relates to a method for diagnosing 
25 cancer in a mammal, especially a human patient, comprising determining 
amplification of a gene in the genome of a mammal wherein said gene is a 
gene of Table 1. 

In a preferred embodiment thereof, the cancer is a member selected 
30 from breast cancer, colon cancer, lung cancer, prostate cancer, ovarian 
cancer, pancreatic cancer, cervical cancer and kidney cancer. 
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In another preferred embodiment thereof, 3. The method of claim 1 
wherein said gene of Table 1 is a gene that encodes the same gene product 
as a polynucleotide selected from the polynucleotides of SEQ ID NO: 1 - 805 
and 855 - 923. 

5 

In another embodiment, the present invention relates to a method for 
diagnosing cancer or a pre-cancerous condition in a mammal, comprising: 

(a) obtaining a cell or tissue sample from a mammal, especially a 
human patient, suspected of having cancer or a pre-cancerous condition and 

1 0 determining for said sample the gene copy number of a gene of Table 1 ; 

(b) comparing said gene copy number of step (a) to the gene copy 
number of the same gene from a sample of a corresponding cell or tissue 
from a mammal of the same species not having cancer of the type being 
diagnosed 

1 5 whereby a higher gene copy number determined in step (a) relative 

to that in step (b) indicates the presence of a cancer or pre-cancerous 
condition in the mammal of step (a) and results in a diagnosis of cancer or a 
pre-cancerous condition in said mammal. 

20 In a preferred embodiment of the methods of the invention, said 

molecule is a member selected from an antisense DNA, an antisense RNA, a 
ribozyme and an siRNA. 

In another embodiment, the present invention relates to a method for 
25 identifying an agent having therapeutic activity in a human patient in need of 
said therapeutic activity, comprising: 

(a) determining in a sample from a patient the level of a gene product 
encoded by a gene of Table 1 prior to administering a test compound to said 
patient; 

30 (b) administering said test compound to said patient; 

(c) determining in a sample from said patient the level of a gene 
product encoded by the same the gene as in step (a) 
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wherein a decrease in the level of said gene product in step (c) relative 
to step (a) identifies said test compound as an agent having therapeutic 
activity. 

5 In a further embodiment, the present invention relates to a method for 

identifying an antineoplastic agent, comprising: 

(a) contacting a test compound with a cell that expresses a gene of 
Table 1 ; and 

(b) determining a change in gene expression as a result of said 
1 0 contacting; 

whereby said change in said gene expression identifies said test 
compound as an antineoplastic agent. 

The present invention also relates to a method for determining the 
1 5 cancerous status of a cell, comprising determining elevated expression in said 
cell of a gene of Table 1 wherein elevated expression of said gene indicates 
. that said cell is cancerous. 

In an additional embodiment, the present invention relates to a method 
20 for identifying a compound as an antineoplastic agent, comprising: 

(a) contacting a test compound with a polypeptide encoded by a gene 
of Table 1, 

(b) determining a change in a biological activity of said polypeptide due 
to said contacting, 

25 wherein a change in activity identifies said test compound as an agent 

having antineoplastic activity. 

In a preferred embodiment of the foregoing, the polypeptide is an 
enzyme selected from kinase, protease, peptidase, phosphodiesterase, 
30 phosphatase, dehydrogenase, reductase, carboxylase, transferase, 
deacetylase and polymerase. 
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The present invention also relates to a method for identifying an anti- 
neoplastic agent comprising contacting a cancerous cell with a compound 
found to have antineoplastic activity in other the methods of the invention 
under conditions promoting the growth of said cell and detecting a change in 
5 the activity of said cancerous cell. 

The present invention further relates to a method for treating cancer 
comprising contacting a cancerous cell with an agent having affinity for an 
expression product of a gene of Table 1 and in an amount effective to cause a 
1 0 reduction in cancerous activity of said cell. 

The present invention also contemplates a method for monitoring the 
progress of cancer therapy in a patient comprising monitoring in a patient 
undergoing cancer therapy the expression of a gene of Table 1 . 

15 

In addition, the present invention encompasses a method for 
determining the likelihood of success of cancer therapy in a patient, 
comprising monitoring in a patient undergoing cancer therapy the expression 
of a gene of Table 1 wherein a decrease in said expression prior to 
20 completion of said cancer therapy is indicative of a likelihood of success of 
said cancer therapy. 

In another embodiment, the present invention relates to a method for 
producing test data with respect to the anti-neoplastic activity of a compound 
25 comprising: 

(a) identifying a test compound as having anti-neoplastic activity using 
other methods of the invention; 

(b) producing test data with respect to the anti-neoplastic activity of 
said test compound sufficient to identify the chemical structure of said test 

30 compound. 
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Additionally, the present invention encompasses a method for determining 
the progress of a treatment for cancer in a patient afflicted therewith, following 
commencement of a cancer treatment on said patient, comprising: 

(a) determining in said patient a change in expression of one or more 
5 genes of Table 1 ; and 

(b) determining a change in expression of said gene compared to 
expression of said one or more determined genes prior to said cancer 
treatment; 

wherein said change in expression indicates progress of said treatment 
1 0 thereby determining the progress of said treatment. 



SEQUENCE LISTING ON CD-ROM ONLY 

15 

The sequences disclosed herein as SEQ ID NO: 1-923 in the sequence 
listing are contained on compact disc (CD-ROM) only, which accompanies 
this application and the contents of said CD-ROMs are hereby incorporated 
by reference in their entirety. These sequence numbers also appear in Table 
20 1 where all sequences are referred to as consecutive serial numbers for 
reference purposes only. 



25 DETAILED SUMMARY OF THE INVENTION 

The present invention relates to a set of genes that are amplified 
and/or over-expressed genes in cancer cell lines and have been localized to 
various chromosomal regions of interest. These genes have been identified 
30 through a combination of CGH, SKY, expression analysis and Reverse 
Transcriptase-Polymerase Chain Reaction (RT-PCR). Such genes are both 
markers and potential therapeutic targets for cancer, in particular breast, 
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colon, lung and prostate malignancies. In addition, the amplified nature of 
such genes provides a means of diagnosing a cancerous condition, or 
predisposition to a cancerous conditions, by determining the amplification of 
one or more of such genes in a patient afflicted with, or predisposed toward, 
5 or otherwise at risk of developing, cancer. 

In accordance with the present invention, a number of genes have 
been localized to a chromosomal regions of interest as identified in Table 1 
(serial number 1-229 (breast), 230-440 (colon), 441-656 (lung) and 657-805 
10 (prostate), serial number 806-923 (transcript or protein)). The invention also 
includes any subsets of these. As described herein, these sequences include 
DNA sequences of SEQ ID NO: 1 - 805, transcripts with the sequences of 
SEQ ID NO: 855 - 923, and proteins/polypeptides with amino acid sequences 
of SEQ ID NO: 806 - 854. 

15 

Briefly, the procedures used to identify the genes disclosed herein may 
be summarized as follows: 

For CGH analysis, based on detailed molecular cytogenetic 
20 characterizations, the following data sets are generated, which may include 
regions reported in the public domain as well as unique regions not previously 
known. 

1. A map of chromosomal regions involved in consistent, recurrent and 
25 high level genomic gains (i.e., amplifications) for a representative 

cancer cell line or tumor type (e.g. colon, prostate, breast and lung) 
that can be recognized as a pattern/signature for a given tumor type. 

2. A map of chromosomal regions containing genomic losses (i.e., 
deletions) in each tumor type and individual cell line to be examined. 

30 3. Levels of intensities of gains and losses categorized for entry into a 
database. 
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4. A comparison of the patterns of gains and losses between the clinical 
samples (e.g. colon xenografts) and cell lines (e.g., colon) of matched 
Stages and Grades. 

5. A comparison of the patterns of gains and losses between primary 
5 prostate tumor cell lines (e.g., CPDR lines) and metastatic prostate 

tumor cell lines (e.g., DU 145, PC3 and LNCaP). 

In accordance with the present invention, for SKY analysis, data sets 
were generated according to the following steps: 
10 1. Identification and development of a database of novel chromosomal 
rearrangements in epithelial cancer cell lines. 

2. Identification of novel translocations involving specific chromosomes or 
chromosomal regions 

3. Reconciliation of SKY and CGH analysis on the same cell line as a 
1 5 verification of the combined findings. 

Combining genomic DNA analysis of gains and losses in the tumor cell 
lines/clinical samples with cDNA expression analysis from matched tumor 
types displayed on a genome template from the Golden Path genome 
20 browser using a Spotfire™ analysis tool: 

1. A pattern of gene expression on a U-95 Affymatix chip set obtained via 
the Gene Logic database was used to generate differential gene 
expression profiles between samples sets containing normal and. 

25 malignant tissues from colon, prostate, lung, breast and various cell 

lines. 

2. A Spotfire™ visualization tool was developed that allowed the 
generation of a list of all the genes that are present in the Golden Path 
within the clustered regions of gains/losses for each cell type/tumor 

30 type to generate the gene sets to include in the HITS platform 

3. The following algorithm was employed: 
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i) Match chromosomal regions of amplification/gains defined by 
CGH with the location of genes/ESTs on an Affymatix chip 
as mapped to a Golden Path genome template. 

ii) Identify genes/ESTs over-expressed in tumor tissue 
5 compared to normal tissue in said chromosomal regions 

using the Gene Logic database. 

iii) Compile data on cell lines of a particular tumor type and 
different tumor types showing clusters of genomic gains and 
losses at certain chromosomal regions. 

10 iv) Pick BACs that span the chromosomal regions consistently 

gained and containing over-expressed genes in an effort to 
positionally clone novel cancer genes (oncogenes and genes 
in relevant pathways) 
v) Validate the identified genes by 

1 5 A) Picking STS markers that identify the gene sequence and 

quantify the relative copy number in genomic DNA and RNA 
across a panel of tumor cell lines. 

B) Develop probes for FISH on chromosomes from tumor cell 
lines and primary tumor tissue micro-arrays. 



20 



25 



4. The expression data from tumor cell lines that have undergone 
SKY/CGH analysis was used to pick candidate genes to validate as 
individual targets in functional genomic assays and in-vivo assays and for 
use in the transcriptional assay platform. 



In accordance with the present invention, over-expression of cellular 
genes is conveniently monitored in model cellular systems using cell lines 
(such as is used in the example below), primary cells, or tissue samples 
maintained in growth media. For different purposes, these may be treated with 
30 compounds at one or more different concentrations to assay for modulating 
agents. Thus, cellular RNAs were isolated from the cells or cultures as an 
indicator of selected gene expression. The cellular RNAs were then divided 
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and subjected to analysis that detected the presence and/or quantity of 
specific RNA transcripts, which transcripts were then amplified for detection 
purposes using standard methodologies, such as reverse transcriptase 
polymerase chain reaction (RT-PCR). The levels of specific RNA transcripts, 
5 including their presence or absence, were determined. When used for 
identification of modulating agents, such as antineoplastic agents, a metric is 
derived for the type and degree of response of the treated sample compared 
to control samples. 

10 In accordance with the foregoing, the genes identified as being 

amplified and/or over-expressed, which can include increased copy number 
thereof, in cancerous cells are localized in chromosomal regions of interest as 
identified in Table 1 (serial number 1-229 (breast), 230-440 (colon), 441-656 
(lung) and 657-805 (prostate); for polypeptide SEQ ID NOs, see Table 1 , 

1 5 serial number 806-923 (transcript or protein)). 

These genes may be utilized to characterize, the cancerous, or non- 
cancerous, status of cells, or tissues. The methods of the invention may be 
used with a variety of cell lines or with primary samples from tumors 
20 maintained in vitro under suitable culture conditions for varying periods of 
time, or in situ in suitable animal models. 

The genes disclosed herein are expressed at levels in cancer cells 
that are different from the expression levels in non-cancer cells. These 
25 genes as identified in Table 1 are amplified in cancer cells relative to non- 
cancer cells of corresponding tissues, especially breast cancer, colon 
cancer, lung cancer, prostate cancer, ovarian cancer, pancreatic cancer, 
cervical cancer and kidney cancer. 

30 
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In accordance with the foregoing, the present invention relates to a 
method for diagnosing cancer in a mammal, comprising determining 
amplification of a gene in the genome of a mammal wherein said gene is a 
gene of Table 1. 

5 

In a preferred embodiment thereof, said gene of Table 1 is a gene 
that encodes the same gene product as a polynucleotide selected from 
the polynucleotides of SEQ ID NO: 1 - 805 and 855 - 923. In a further 
preferred embodiment, said mammal is a human patient 

10 

The present invention is also directed to a method for diagnosing 
cancer or a pre-cancerous condition in a mammal, preferably a human 
patient, comprising: 

(a) obtaining a cell or tissue sample from a mammal suspected of 
1 5 having cancer or a pre-cancerous condition and determining for said sample 

the gene copy number of a gene of Table 1 ; 

(b) comparing said gene copy number of step (a) to the gene copy 
number of the same gene from a sample of a corresponding cell or tissue 
from a mammal of the same species not having cancer of the type being 

20 diagnosed 

whereby a higher gene copy number determined in step (a) relative 
to that in step (b) indicates the presence of a cancer or pre-cancerous 
condition in the mammal of step (a) and results in a diagnosis of cancer or a 
pre-cancerous condition in said mammal. 

25 

In specific embodiments, the cancer to be diagnosed is one or more of 
breast cancer, colon cancer, lung cancer, prostate cancer, ovarian cancer, 
pancreatic cancer, cervical cancer and kidney cancer. 

30 Preferably, the gene of Table 1 is a gene that encodes the same gene 

product as a polynucleotide of SEQ ID NO: 1 - 805 and 855- 923. 
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The present invention is also directed to a method of inhibiting cancer, 
or a precancerous condition, in a mammalian cell, comprising contacting said 
cell with a molecule that inhibits function of a gene of Table 1. Preferably, the 
gene of Table 1 is a gene that encodes the same gene product as a 
5 polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. In a specific 
embodiment thereof, said molecule inhibits gene function by binding to said 
gene. In other embodiments, the molecule inhibits gene function by binding to 
an RNA encoded by said gene or inhibits gene function by binding to 
polypeptide encoded by said gene. Preferably, the molecule is a member 
10 selected from an antisense DNA, an antisense RNA, a ribozyme and an 
siRNA. Also preferred is where the cancer is a member selected from breast 
cancer, colon cancer, lung cancer, prostate cancer, ovarian cancer, 
pancreatic cancer, cervical cancer and kidney cancer. 

1 5 The invention contemplates that such contacting occurs in vivo. 

The invention also relates to a method for identifying an agent having 
therapeutic activity in a human patient in need of said therapeutic activity, 
comprising: 

20 (a) determining in a sample from a patient the level of a gene product 

encoded by a gene of Table 1 prior to administering a test compound to said 
patient; 

(b) administering said test compound to said patient; 

(c) determining in a sample from said patient the level of a gene 
25 product encoded by the same the gene as in step (a) 

wherein a decrease in the level of said gene product in step (c) relative 
to step (a) identifies said test compound as an agent having therapeutic 
activity. 

30 Preferably, said therapeutic activity is anticancer activity and said 

cancer is one or more of breast cancer, colon cancer, lung cancer, prostate 
cancer, ovarian cancer, pancreatic cancer, cervical cancer and kidney cancer. 
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Also preferred is where said gene product is an RNA or a polypeptide, 
especially where an activity of the polypeptide is determined, preferably an 
enzyme activity. In specific embodiments, said gene of Table 1 is a gene that 
encodes the same gene product as a polynucleotide of SEQ ID NO: 1 - 805 
5 and 855 - 923, as well as where said molecule is a member selected from an 
antisense DNA, an antisense RNA, a ribozyme and an siRNA. 

The present invention also relates to a method for identifying an 
antineoplastic agent, comprising: 
1 0 (a) contacting a test compound with a cell that expresses a gene of 

Table 1 ; and 

(b) determining a change in gene expression as a result of said 
contacting; 

whereby said change in said gene expression identifies said test 
1 5 compound as an antineoplastic agent. 

Most preferred is where the change in expression is a decrease in 
expression. The contacting may occur //? wVo. Also preferred is where 
said gene of Table 1 encodes the same gene product as a polynucleotide of 
20 SEQ ID NO: 1 - 805 and 855 - 923 and where said molecule is a member 
selected from an antisense DNA, an antisense RNA, ribozyme, an siRNA, a 
small organic molecule and an antibody. 

The present invention also relates to a method for determining the 
25 cancerous status of a cell, comprising determining elevated expression in said 
cell of a gene of Table 1 wherein elevated expression of said gene indicates 
that said cell is cancerous. Preferably, wherein said elevated expression is an 
elevated copy number of the gene and wherein said gene of Table 1 encodes 
the same gene product as a polynucleotide of SEQ ID NO: 1 - 805 and 855 - 
30 923. 
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The present invention further relates to a method for identifying a 
compound as an antineoplastic agent, comprising: 

(a) contacting a test compound with a polypeptide encoded by a gene 
of Table 1 , 

5 (b) determining a change in a biological activity of said polypeptide due 

to said contacting, 

wherein a change in activity identifies said test compound as an agent 
having antineoplastic activity. 

1 0 Preferably, said gene of Table encodes the same gene product as a 

polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. 

In a preferred embodiment, the change in biological activity is a 
decrease in biological activity. 

15 

In another preferred embodiment, the biological activity is an enzyme 
activity, such as where the enzyme is one selected from the group kinase, 
protease, peptidase, phosphodiesterase, phosphatase, dehydrogenase, 
reductase, carboxylase, transferase, deacetylase and polymerase. 

20 

Assays for these enzymes are available, such as for 
phosphodiesterases (the most pharmacologically relevant 
phosphodiesterases are those that hydrolyze cyclic nucleotides (see r for 
example, cAMP and cGMP assays available from Perkin-Elmer, as well as 
25 Estrade et al., Eur. J. Pharmacol. 352:2-3, 157-163 (1998)). 

Protein phosphatases remove phosphate residues from proteins. 
Most tests of their activity use the same assays as for protein kinases. A 
non-radioactive phosphatase assay system is available from Promega 
30 Biotech. 
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The therapeutically most relevant dehydrogenases oxidize or reduce 
small molecular weight metabolites, esp. steroid hormones, or that 
generally use or generate NAD or NADP (see: Haeseleer et al., J, Biol. 
Chem., 273:21790-21799 (1998)). A commercial assay is available from 
5 Cayman Chemical (at www.caymanchem.com). 

Gamma-carboxylases are important enzymes in the blood 
coagulation process. The main assay protocols use synthetic peptides 
(see: Ulrich et al., J. Biol. Chem., 263:9697-9702 (1988); Begley et al., 
10 J. Biol. Chem., 275:36245-36249 (2000)). 



In highly preferred embodiments, the kinase is one of a protein kinase, 
a serine or threonine kinase, or a receptor tyrosine protein kinase. Where the 

1 5 polypeptide encoded by a gene of the invention is a protein kinase, especially 
involving tyrosine kinase, various assays for activity are available. Protein 
kinases add phosphate groups to serine, threonine or tyrosine residues on 
proteins, most commonly measured with phospho-serine, threonine, or 
tyrosine-specific antibodies, or generation of radiolabeled substrate, or 

20 consumption of ATP, or phosphorylation of (synthetic) small peptides, or 
measuring downstream enzyme activity and gene transcription. Such 
assays are commercially available. (See, for example, the tyrosine kinase 
assay from Roche Molecular Biochemicals). Assays for serine/threonine 
kinases are also available at Chromagen.com, Upstate Biotechnology, 

25 Inc. (Lake Placid, NY, and at upstatebiotech.com) and from Applied 
BioSystems (Foster City, CA (home.appliedbiosystems.com)). 

In other specific embodiments, the protease is a serine protease, 
cysteine protease or aspartic acid protease, or the transferase is a 
30 methyltransferase, preferably a cytosine methyltransferase or an adenine 
methyltransferase, or the deacetylase is a histone deacetylase, or the 
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carboxylase is a y-carboxylase, or the peptidase is a zinc peptidase, or the 
polymerase is a DNA polymerase or an RNA polymerase. 

Proteases degrade proteins, un-specifically or at specific sites. 
5 Almost all pharmacologically relevant ones have very narrowly defined 
specific substrates, and their activity is most often measured by directly 
measuring cleavage product or generation of (fluorescent) light after 
cleavage of synthetic substrates. Assays are available for serine proteases 
(Calbiochem, Palo Alto, CA, and see Berdichevsky et al., J. Virol. 
10 Methods, 107:245-255 (2003), for systeine proteases (See: Schulz et al., 
Mol. Pathol., 51:222-24 (1998) and Selzer et al., PNAS, 96:11015- 
11022 (1999)), for aspartic acid proteases (Geno Tech, Inc. at 
www.genotech.com) and for zinc peptidases (see Evans et al., J. Biol. 
Chem., 278:23180-23186 (2003)). 

15 

Both (regulatory) DNA-methylases and (biosynthetic) protein 
methylases that are drug targets. (See: Jonassen and Clarke, J. Biol. 
Chem., 275:12381-12387 (2000); Jackson et al., Nature, 416:556-560 
(2002)). 

20 

Most HDAC (histone deacetyiase) assays use colorimetric or 
fluorometric (synthetic) substrates. Standard assays are for binding, 
especially molecular size changes, blocking a specific site, and effects on 
25 transcription or downstream reactions (if DNA or RNA is the direct target 
of a drug). A commercial assay is available from Vinci Biochem (at 
www.vincibiochem.it). 

In another specific embodiment, the biological activity is a membrane 
30 transport activity, preferably wherein the polypeptide is a cation channel 
protein, an anion channel protein, a gated-ion channel protein or an ABC 
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transporter protein. Drug effects on the activity of transporter and channel 
proteins are screened by measuring increase or decrease of the ((radio- 
labeled) transported entity inside or outside the cell, in cell-based assays, 
ATP consumption (in the case of ATPases), or changes in cell membrane 
5 potential. Assays employing such proteins are available, such as for ABC 
transporter (see: Marcil et al., Lancet, 354:1341-46 (1999) and for ion 
channels (from Evotec OAI, at www.evotecoai.com and from 
PharmaLinks, at www.pharmalinks.org/research/cellsignalling). 

10 In one embodiment, the polypeptide is an integrin (the signal 

transduction pathways elicited by the integrins are slow and not very well 
characterized, hence most assays are either just binding assays or 
measure downstream biological phenomena (such as migration, invasion, 
etc.) (See: Ganta et al., Endocrinology, 138:3606-3612 (1997); Sim et 

15 aL, J. Biomed. Mater, Research, 68A:352-359 (2004); and Weinreb et 
al., Anal. Biochem., 306:305-313 (2002)), or a Cytochrome P450 enzyme 
(almost all cytochrome assays require knowledge of what the substrate is 
and measure conversion of substrate (free or (radio-)labeled) or generation 
of product; useful C 14 -labeled substrates are available from Amersham 

20 Biosciences at www1.amershambiosciences.com), or a nuclear hormone 
receptor (Assays available from Discovers, Fremont, CA, such as an estrogen 
assay; also see Rosen et aL, Curr. Opin. Drug. Discov. DeveL, 6:224-30 
(2003)). 

25 In one preferred embodiment, the biological activity is a receptor 

activity, preferably where the receptor is a G-protein-coupled receptor 
(GPCR). 

GPCRs are transmembrane proteins that wind 7 times back and 
30 forth through a cell's plasma membrane with a ligand binding site located 
on the outside of the membrane surface of the cell and the effector site 
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being present inside the cell. These receptors bind GDP and GTP. In 
response to ligand binding, GPCRs activate signal transduction pathways 
which induce a number of assayable physiological changes, e.g., an 
increase in intracellular calcium levels, cyclic-AMP, inositol phosphate 
5 turnover, and downstream gene transcription (directly or via reporter- 
assays) along with other translocation assays available for measuring 
GPCR activation when the polypeptide encoded by a gene of the invention 
is a GPCR. Thus, such proteins work through a second messenger. The 
result is activation of CREB, a transcription factor that stimulates the 

10 production of gene products. One useful assay is the so-called 
BRET2/arrestin assay, useful in screening for compounds that interact 
with GPCRs. (See: Bertrand et al, J. Recept. Signal Transduct Res., 
22:533-41 (Feb. -Nov. 2002)). In addition, numerous assays are 
commercially available, such as the Transfluor Assay, available from 

15 Norak Biosciences, Inc. (www.norakbio.com) or ArrayScan and 
KineticScan, both from Cellomics, or assays from CyBio (Jena, Germany). 

Assays useful with the invention are usually set up to screen for 
agonists or antagonists after adding ligand, but effects on most of these 
20 parameters can be measured whether or not the ligand for the receptor is 
known. Such assays may involve radioligand-binding assays. Activation of 
the majority of GPCRs by agonists leads to the interaction of beta-arrestin 
(a protein that is involved in receptor desensitization and sequestration) 
with the receptor, which is measurable by fluorescence energy transfer 

25 

The disclosure of all journal articles, or other publications, referred to 
herein are hereby incorporated by reference in their entirety. 

In one embodiment, the polypeptide is in a solution or suspension and 
30 contact with the test compound is by direct contact between the test 
compound and the protein molecule. Alternatively, the polypeptide may be in 
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a cell and the test compound may have to diffuse into the cell in order to 
contact the polypeptide. In an alternative embodiment, the test compound 
may be contacted with a cell that contains or expresses the polypeptide but 
the test compound may have no direct contact with the polypeptide. In stead, 
5 the test compound may act to induce production and/or activity of a different 
compound, such as an intracellular second messenger that serves to contact 
the polypeptide and modulate or change the biological activity of this 
polypeptide. 

10 In accordance with the foregoing, the method of the present invention 

includes cancer modulating agents that are themselves either polypeptides, or 
small chemical entities, that affect the cancerous process, including initiation, 
suppression or facilitation of tumor growth, either in vivo or ex vivo. Such 
agents may also be antibodies that react with one or more polypeptides 

1 5 encoded by genes as disclosed herein, preferably polypeptides comprising 
any one of the amino acid sequences of SEQ ID NO: 806 - 854. 

Because the genes disclosed herein are over-expressed and relate to 
the cancerous condition of a cell, . successful antineoplastic activity will 

20 commonly be exhibited by agents that reduce the expression of said genes as 
identified in Table 1. In one embodiment thereof, the change in expression is 
a decrease in copy number of the gene or genes under study. In accordance 
therewith, said change in gene copy number is conveniently determined by 
detecting a change in expression of messenger RNA encoded by said gene 

25 sequence. In another preferred embodiment, expression is determined for 
more than one such gene, such as 2, 5, 10 or more of the genes. 

Other methods useful in measuring a change in expression of the 
genes disclosed herein include measuring a change in the amount or rate of 
30 synthesis of a polypeptide encoded by said gene, preferably a decrease in 
synthesis of said polypeptide. Most preferably, the polypeptide comprises an 
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amino acid sequence highly homologous to a sequence for genes as 
identified in Table 1 (SEQ ID NO: 1 - 923). 

The methods of the invention can thus be utilized to identify anti- 
5 neoplastic agents useful in treatment of cancerous conditions. Such activity 
can be further modified by first identifying such an agent using an assay as 
already described and further contacting such agent with a cancerous cell, 
followed by monitoring of the status of said cell, or cells. A change in status 
indicative of successful antineoplastic activity may include a decrease in the 
10 rate of replication of the cancerous cell(s), a decrease in the total number of 
progeny cells that can be produced by said cancerous cell(s), or a decrease in 
the number of times said cancerous cell(s) can replicate, or the death of said 
cancerous cell(s). 

1 5 Anti-neoplastic agents may also be identified using recombinant cells 

suitably engineered to contain and express the cancer-related genes 
disclosed herein. In one such embodiment, a recombinant cell is formed using 
standard technology and then utilized in the assays disclosed herein. Methods 
of forming such recombinant cells are well known in the literature. See, for 

20 example, Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor, N.Y., (1989), Wu et al, Methods in Gene 
Biotechnology (CRC Press, New York, NY, 1997), and Recombinant Gene 
Expression Protocols, in Methods in Molecular Biology, Vol. 62, (Tuan, ed., 
Humana Press, Totowa, NJ, 1997), the disclosures of which are hereby 

25 incorporated by reference. 

The present invention also relates to a method for detecting the 
cancerous status of a cell, comprising detecting elevated copy number and/or 
expression in said cell of at least one gene that maps to the chromosomal 
30 region of interest as identified in Table 1 (SEQ ID NO: 1 - 923). Such 
elevated expression may be readily monitored by comparison to that of 
otherwise normal cells having the same genes. Elevated expression of these 
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genes is indicative of the cancerous state. This includes a gene 
corresponding to a polynucleotide that comprises a nucleotide sequence as 
identified in Table 1 (SEQ ID NO: 1 - 923). Such elevated expression, 
including increased copy number, may be the expression of more than one 
5 such gene. 

The present invention also relates to a method for detecting a cancer- 
linked gene comprising the steps of contacting a compound identified as 
having gene modulating activity for a gene corresponding to a polynucleotide 

10 that comprises a nucleotide sequence as identified in Table 1 (SEQ ID NO: 1 
- 923) with a cell expressing a test gene and detecting modulation, such as 
decreased activity, of such test gene relative to when said compound is not 
present thereby identifying said test gene as a cancer-related gene. In 
preferred embodiments, the gene determined by said method is an oncogene, 

15 or cancer facilitating gene. 

In another embodiment, there is provided a method for treating cancer 
comprising contacting a cancerous cell with an agent first identified as having 
gene modulating activity using any of the assay methods disclosed according 
20 to the invention and in an amount effective to reduce the cancerous activity of 
said cell. In a preferred embodiment, the cancerous cell is contacted in vivo. 
In other preferred embodiments, said reduction in cancerous activity is a 
decrease in the rate of proliferation of said cancerous cell, or said reduction in 
cancerous activity is the death of said cancerous cell. 

25 

The present invention further relates to a method for treating cancer 
comprising contacting a cancerous cell with an agent having activity against 
an expression product encoded by a gene corresponding to a polynucleotide 
comprising a nucleotide sequence as identified in Table 1 (SEQ ID NO: 1 - 
30 923)where the product is a polypeptide, most preferably one comprising an 
amino acid sequence as identified in Table 1 (SEQ ID NO: 806 - 854). In a 
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preferred embodiment, said cancerous cell is contacted in vivo. In another 
preferred embodiment, the agent is an antibody. 

As noted, the genes useful in the assay methods include genes 
5 mapping within chromosomal regions of interest and genes as identified in 
Table 1 (SEQ ID NO: 1 - 923), or a gene that encodes the same RNA, such 
as the same messenger RNA, whose corresponding cDNA is one of the 
sequences as identified in Table 1 (SEQ ID NO: 1 - 923). The genes useful in 
the methods of the invention further include genes encoding RNAs whose 
10 corresponding cDNA is at least 90% identical to a sequence as identified in 
Table 1 (SEQ ID NO: 1 - 923), preferably at least about 95% identical to such 
a sequence, more preferably at least about 98% identical to such sequence 
and most preferably one comprising that sequence are specifically 
contemplated by all of the methods of the present invention. 

15 

In addition, sequences encoding the same proteins (SEQ ID NO: 806 - 
854) as any of these sequences, regardless of the percent identity of such 
sequences, are also specifically contemplated by the invention. 

20 The sequences disclosed herein may be genomic in nature and thus 

represent the sequence of an actual gene, such as a human gene, or may be 
a cDNA sequence derived from a messenger RNA (mRNA) and thus 
represent contiguous exonic sequences derived from a corresponding 
genomic sequence or they may be wholly synthetic in origin for purposes of 

25 . testing. As described in the Example, the expression of these cancer-related 
genes is determined from the relative expression levels of the RNA 
complement of a cancerous cell relative to a normal (i.e., non-cancerous) cell. 
Because of the processing that may take place in transforming the initial RNA 
transcript into the final mRNA, the sequences disclosed herein may represent 

30 less than the full genomic sequence. They may also represent sequences 
derived from ribosomal and transfer RNAs. Consequently, the genes present 
in the cell (and representing the genomic sequences) and the sequences 
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disclosed herein, which are mostly cDNA sequences, may be identical or may 
be such that the cDNAs contain less than the full genomic sequence. Such 
genes and cDNA sequences are still considered corresponding sequences 
because they both encode similar RNA sequences. Thus, by way of non- 

5 limiting example only, a gene that encodes an RNA transcript, which is then 
processed into a shorter mRNA, is deemed to encode both such RNAs and 
therefore encodes an RNA complementary to (using the usual Watson-Crick 
complementarity rules), or that would otherwise be encoded by, a cDNA (for 
example, a sequence as disclosed herein). Thus, the sequences disclosed 

1 0 herein correspond to genes contained in the cancerous or normal cells used 
to determine relative levels of expression because they represent the same 
sequences or are complementary to RNAs encoded by these genes. Such 
genes also include different alleles and splice variants that may occur in the 
cells used in the methods of the invention. 

15 

The genes of the invention "correspond to" a polynucleotide having a 
sequence as identified in Table 1 (SEQ ID NO: 1 - 923) if the gene encodes 
an RNA (processed or unprocessed, including naturally occurring splice 
variants and alleles) that is at least 90% identical, preferably at least 95% 

20 identical, most preferably at least 98% identical to, and especially identical to, 
an RNA that would be encoded by, or be complementary to, such as by 
hybridization with, a polynucleotide having the indicated sequence. In 
addition, genes including sequences at least 90% identical to a sequence as 
identified in Table 1 (SEQ ID NO: 1 - 923), preferably at least about 95% 

25 identical to such a sequence, more preferably at least about 98% identical to 
such sequence and most preferably comprising such sequence are 
specifically contemplated by all of the methods of the present invention as 
being genes that correspond to these sequences. In addition, sequences 
encoding the same proteins as any of these sequences, regardless of the 

30 percent identity of such sequences, are also specifically contemplated by any 
of the methods of the present invention that rely on any or all of said 
sequences, regardless of how they are otherwise described or limited. Thus, 
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any such sequences are available for use in carrying out any of the methods 
disclosed according to the invention. Such sequences also include any open 
reading frames, as defined herein, present within any of the sequences as 
identified in Table 1 (SEQ ID NO: 1 - 805 and 855 - 923). 

5 

Further in accordance with the present invention, the term "percent 
identity" or "percent identical," when referring to a sequence, means that a 
sequence is compared to a claimed or described sequence after alignment of 
the sequence to be compared (the "Compared Sequence") with the described 
10 or claimed sequence (the "Reference Sequence"). The Percent Identity is 
then determined according to the following formula: 



Percent Identity = 100[1-(C/R)] 



1 5 wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid 
in the Reference Sequence that does not have a corresponding aligned base 
or amino acid in the Compared Sequence and (ii) each gap in the Reference 

20 Sequence and (iii) each aligned base or amino acid in the Reference 
Sequence that is different from an aligned base or amino acid in the 
Compared Sequence, constitutes a difference; and R is the number of bases 
or amino acids in the Reference Sequence over the length of the alignment 
with the Compared Sequence with any gap created in the Reference 

25 Sequence also being counted as a base or amino acid. 

If an alignment exists between the Compared Sequence and the 
Reference Sequence for which the percent identity as calculated above is 
about equal to or greater than a specified minimum Percent Identity then the 
30 Compared Sequence has the specified minimum percent identity to the 
Reference Sequence even though alignments may exist in which the 
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hereinabove calculated Percent Identity is less than the specified Percent 
Identity. 

As used herein, the terms "portion," "segment," and "fragment," when 
used in relation to polypeptides, refer to a continuous sequence of residues, 
5 such as amino acid residues, which sequence forms a subset of a larger 
sequence. For example, if a polypeptide were subjected to treatment with any 
of the common endopeptidases, such as trypsin or chymotrypsin, the 
oligopeptides resulting from such treatment would represent portions, 
segments or fragments of the starting polypeptide. When used in relation to a 
1 0 polynucleotide, such terms refer to the products produced by treatment of said 
polynucleotides with any of the common endonucleases, or any stretch of 
polynucleotides that could be synthetically synthesized. 

As used herein, the term "DNA segment" or "DNA sequence" refers to 
1 5 a DNA polymer, in the form of a separate fragment or as a component of a 
larger DNA construct, which has been derived from DNA, and may include 
both single stranded and duplex sequences. Such segments are provided in 
the form of an open reading frame uninterrupted by internal non-translated 
sequences, or introns, which are typically present in eukaryotic genes. 

20 

The term "coding region" refers to that portion of a gene which either 
naturally or normally codes for the expression product of that gene in its 
natural genomic environment, i.e., the region coding in vivo for the native 
expression product of the gene. 

25 

The term "nucleotide sequence" refers to a heteropolymer of 
deoxyribonucleotides. Generally, DNA segments encoding the proteins 
provided by this invention are assembled from cDNA fragments and short 
oligonucleotide linkers, or from a series of oligonucleotides, to provide a 
30 synthetic gene which is capable of being expressed in a recombinant 
transcriptional unit comprising regulatory elements derived from a microbial or 
viral operon. 
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The term "expression product" means that polypeptide or protein that is 
the natural translation product of the gene and any nucleic acid sequence 
coding equivalents resulting from genetic code degeneracy and thus coding 
5 for the same amino acid(s). 

The term "fragment," when referring to a coding sequence, means a 
portion of DNA comprising less than the complete coding region whose 
expression product retains essentially the same biological function or activity 
1 0 as the expression product of the complete coding region. 

The present invention also finds use as a means of diagnosing the 
presence of cancer in a patient, as where a sample of cancerous tissues or 
cells, or tissues or cells suspected of being cancerous. For such purposes, 
1 5 and in accordance with the disclosure elsewhere herein, such diagnosis is 
based on the detection of elevated expression or amplification, such as 
elevated copy number, of one or more of the genes identified according to the 
invention. Such elevated expression can be determined by any of the means 
described herein. 

20 

In one such embodiment, the elevated expression, as compared to 
normal cells and/or tissues of the same organ, is determined by measuring 
the relative rates of transcription of RNA, . such as by production of 
corresponding cDNAs and then analyzing the resulting DNA using probes 

25 developed from the gene sequences as identified in Table 1 . Thus, the levels 
of cDNA produced by use of reverse transcriptase with the full RNA 
complement of a cell suspected of being cancerous produces a corresponding 
amount of cDNA that can then be amplified using polymerase chain reaction, 
or some other means, such as rolling circle amplification, to determine the 

30 relative levels of resulting cDNA and, thereby, the relative levels of gene 
expression. 
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For RNA analysis, the latter may be isolated from samples in a variety 
of ways, including lysis and denaturation with a phenolic solution containing a 
chaotropic agent (e.g., triazol) followed by isopropanol precipitation, ethanol 
wash, and resuspension in aqueous solution; or lysis and denaturation 
5 followed by isolation on solid support, such as a Qiagen resin and 
reconstitution in aqueous solution; or lysis and denaturation in non-phenolic, 
aqueous solutions followed by enzymatic conversion of RNA to DNA template 
copies. Steady state RNA levels for a given type of cell or tissue may have to 
be ascertained prior to employment of the methods of the invention but such 
10 is well within the skill of those in the art and will not be further described in 
detail herein. 

Alternatively, increased expression, such as increased copy number, 
may be determined for the genes present in a cancerous cell, or a cell 

15 suspected of being cancerous, by using the nucleotides sequences as 
identified in Table 1 as a means of generating probes for the DNAs present in 
the cells ta be examined. Thus, the DNA of such cells may be extracted and 
probed using the sequences disclosed herein for the presence in the 
genomes of such cells of increased amounts of one or more of the genes of 

20 the invention. For example, where a cancer-related, or cancer-linked, gene as 
disclosed herein is found to be present in multiple copies within the genome of 
a cell, even where it may not be actively being over-expressed at the time of 
such determination, this may be indicative of at least a disposition toward 
developing cancer at a subsequent time. 

25 

In accordance with the foregoing, the presence of such multiple copies 
of a gene, or genes, as disclosed herein may be determined using northern or 
southern blotting and employing the sequences as identified in Table 1 to 
develop probes for this purpose. Such probes may be composed of DNA or 
30 RNA and may advantageously be comprised of a contiguous stretch of 
nucleotide residues matching, or complementary to, a sequence as identified 
in Table 1. Such probes will most usefully comprise a contiguous stretch of at 
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least 15, preferably at least 30, more preferably at least 50, most preferably at 
least 80, and especially at least 100, even 200 residues, derived from one or 
more of the sequences as identified in Table 1 . Thus, where a single probe 
binds multiple times to the genome of a sample of cells that are cancerous, or 
5 are suspected of being cancerous, or predisposed to become cancerous, 
whereas binding of the same probe to a similar amount of DNA derived from 
the genome of otherwise non-cancerous cells of the same organ or tissue 
results in observably less binding, this is indicative of the presence of multiple 
copies of a gene comprising, or corresponding to, the sequence as identified 
1 0 in Table 1 from which the probe sequenced was derived. 

Increased expression may also be determined using agents that 
selectively bind to, and thereby detect, the presence of expression products of 
the genes disclosed herein. For example, an antibody, possibly a suitably 

1 5 labeled antibody, such as where the antibody is bound to a fluorescent or 
radiolabel, may be generated against one of the polypeptides comprising a 
sequence as identified in Table 1 (serial number 1-229 (breast), 230-440 
(colon), 441-656 (lung) and 657-805 (prostate); for polypeptide SEQ ID NOs, 
see Table 1, serial number 806-923 (transcript or protein)), and said antibody 

20 will then react with, binding either selectively or specifically, to a polypeptide 
encoded by one of the genes that corresponds to a sequence disclosed 
herein. Such antibody binding, especially relative extent of such binding in 
samples derived from suspected cancerous, as opposed to otherwise non- 
cancerous, cells and tissues, can then be used as a measure of the extent of 

25 expression, or over-expression, of the cancer-related genes identified herein. 
Thus, the genes identified herein as being over-expressed in cancerous cells 
and tissues may be over-expressed due to increased copy number, or due to 
over-transcription, such as where the over-expression is due to over- 
production of a transcription factor that activates the gene and leads to 

30 repeated binding of RNA polymerase, thereby generating large than normal 
amounts of RNA transcripts, which are subsequently translated into 
polypeptides, such as the polypeptides comprising amino acid sequences as 
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identified in Table 1 (SEQ ID NO: 1 - 923). Such analysis provides an 
additional means of ascertaining the expression of the genes identified 
according to the invention and thereby determining the presence of a 
cancerous state in a sample derived from a patient to be tested, of the 
5 predisposition to develop cancer at a subsequent time in said patient. 

In employing the methods of the invention, it should be borne in mind 
that gene expression indicative of a cancerous state need not be 
characteristic of every cell found to be cancerous. Thus, the methods 

10 disclosed herein are useful for detecting the presence of a cancerous 
condition within a tissue where less than all cells exhibit the complete pattern 
of over-expression. For example, a set of selected genes, comprising 
sequences homologous under stringent conditions, or at least 90%, preferably 
95%, identical to at least one of the sequences as identified in Table 1, may 

1 5 be found, using appropriate probes, either DNA or RNA, to be present in as 
little as 60% of cells derived from a sample of tumorous, or malignant, tissue 
while being absent from as much as 60% of cells derived from corresponding 
non-cancerous, or otherwise normal, tissue (and thus being present in as 
much as 40% of such normal tissue cells). In a preferred embodiment, such 

20 gene pattern is found to be present in at least 70% of cells drawn from a 
cancerous tissue and absent from at least 70% of a corresponding normal, 
non-cancerous, tissue sample. In an especially preferred embodiment, such 
gene pattern is found to be present in at least 80% of cells drawn from a 
cancerous tissue and absent from at least 80% of a corresponding normal, 

25 non-cancerous, tissue sample. In a most preferred embodiment, such gene 
pattern is found to be present in at least 90% of cells drawn from a cancerous 
tissue and absent from at least 90% of a corresponding normal, non- 
cancerous, tissue sample. In an additional embodiment, such gene pattern is 
found to be present in at least 100% of cells drawn from a cancerous tissue 

30 and absent from at least 100% of a corresponding normal, non-cancerous, 
tissue sample, although the latter embodiment may represent a rare 
occurrence. 
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In an additional aspect, the present invention relates to a method for 
determining a cancer initiating or facilitating gene comprising contacting a cell 
expressing a test gene (i.e., a gene whose status as a cancer initiating or 

5 facilitating gene is to be determined) with an agent that decreases the 
expression of a gene that encodes an RNA at least 90%, preferably 95%, 
identical to an RNA encoded by (i.e., a gene corresponding to) a 
polynucleotide comprising, or having, a sequence selected from the group 
consisting as identified in Table 1 and detecting a decrease in expression of 

1 0 said test gene compared to when said agent is not present, thereby identifying 
said test gene as being a cancer initiating or facilitating gene. Such genes 
may, of course, be oncogenes and said decrease in expression may be due 
to a decrease in copy number of said gene in said cell or a cell derived from 
said cell, such as where copy number is reduced in the cells formed by 

1 5 replication of such cells. 

Thus, some or all of the genes disclosed herein as corresponding to as 
identified in Table 1 are found to play a direct role in the initiation or 
progression of cancer or even other diseases and disease processes. 

20 Because changes in expression of these genes (up-regulation) are linked to 
the disease state (i.e. cancer), the change in expression may contribute to the 
initiation or progression of the disease. For example, if a gene that is up- 
regulated is an oncogene such a gene provides for a means of screening for 
small molecule therapeutics beyond screens based upon expression output 

25 alone. For example, genes that display up-regulation in cancer and whose 
elevated expression contributes to initiation or progression of disease 
represent targets in screens for small molecules that inhibit or block their 
function. Examples include, but are not be limited to, kinase inhibition, cellular 
proliferation, substrate analogs that block the active site of protein targets, etc. 

30 

It should be noted that there are a variety of different contexts in which 
genes have been evaluated as being involved in the cancerous process. 
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Thus, some genes may be oncogenes and encode proteins that are directly 
involved in the cancerous process and thereby promote the occurrence of 
cancer in an animal. Other genes may simply be involved either directly or 
indirectly in the cancerous process or condition and may serve in an ancillary 
5 capacity with respect to the cancerous state. All such types of genes are 
deemed with those to be determined in accordance with the invention as 
disclosed herein. Thus, the gene determined by said method of the invention 
may be an oncogene, or the gene determined by said method may be a 
cancer facilitating gene, the latter including a gene that directly or indirectly 

10 affects the cancerous process, either in the promotion of a cancerous 
condition or in facilitating the progress of cancerous growth or otherwise 
modulating the growth of cancer cells, either in vivo or ex vivo. Such genes 
may work indirectly where their expression alters the activity of some other 
gene or gene expression product that is itself directly involved in initiating or 

1 5 facilitating the progress of a cancerous condition. For example, a gene that 
encodes a polypeptide, either wild or mutant in type, which polypeptide acts to 
suppress of tumor suppressor gene, or its expression product, will thereby act 
indirectly to promote tumor growth. 

20 In accordance with the foregoing, the method of the present invention 

includes cancer modulating agents that are themselves either polypeptides, or 
small chemical entities, that affect the cancerous process, including initiation, 
suppression or facilitation of tumor growth, either in vivo or ex vivo. Such 
agents may also be antibodies that react with one or more of the polypeptides 

25 as identified in Table 1 ((SEQ ID NO: 806-923 (transcript or protein)). 

In keeping with the disclosure herein, the present invention also relates 
to a method for treating cancer comprising contacting a cancerous cell with an 
agent having activity against an expression product encoded by a gene 
30 mapping within regions of chromosomal interest or, alternatively, a gene 
corresponding to a polynucleotide that comprises a nucleotide sequence as 



31 



WO 2004/091548 



PCT/US2004/009289 



identified in Table 1 , such as where such expression product is one the 
polypeptides as identified in Table 1. 

The method of the present invention includes embodiments of the 
5 above-recited method wherein said cancer cell is contacted in vivo as well as 
ex vivo, preferably wherein said agent comprises a portion, or is part of an 
overall molecular structure, having affinity for said expression product. In one 
such embodiment, said portion having affinity for said expression product is 
an antibody. 

10 

In one embodiment of the present invention, a chemical agent, such as 
a protein or other polypeptide, is joined to an agent, such as an antibody, 
having affinity for an expression product of a cancerous cell, such as a 
polypeptide or protein encoded by a gene related to the cancerous process, 

1 5 especially a gene sequence corresponding to one of the cDNA sequences as 
identified in Table 1. In a specific embodiment, said expression product acts 
as a therapeutic target for the affinity portion of said anticancer agent and 
where, after binding of the affinity portion of such agent to the expression 
product, the anti-cancer portion of said agent acts against said expression 

20 product so as to neutralize its effects in initiating, facilitating or promoting 
tumor formation and/or growth. In a separate embodiment of the present 
invention, binding of the agent to said expression product may, without more, 
have the effect of deterring cancer promotion, facilitation or growth, especially 
where the presence of said expression product is related, either intimately or 

25 only in an ancillary manner, to the development and growth of a tumor. Thus, 
where the presence of said expression product is essential to tumor initiation 
and/or growth, binding of said agent to said expression product will have the 
effect of negating said tumor promoting activity. In one such embodiment, said 
agent is an apoptosis-inducing agent that induces cell suicide, thereby killing 

30 the cancer cell and halting tumor growth. 
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Many cancers contain chromosomal rearrangements, which typically 
represent translocations, amplifications, or deletions of specific regions of 
genomic DNA. A recurrent chromosomal rearrangement that is associated 
with a specific stage and type of cancer always affects a gene (or possibly 

5 genes) that play a direct and critical role in the initiation or progression of the 
disease. Many of the known oncogenes or tumor suppressor genes that play 
direct roles in cancer have either been initially identified based upon their 
positional cloning from a recurrent chromosomal rearrangement or have been 
demonstrated to fall within a rearrangement subsequent to their cloning by 

10 other methods, in all cases, such genes display amplification at both the level 
of DNA copy number and at the level of transcriptional expression at the 
mRNA level. 

The present method also relates to a method for determining 
1 5 functionally related genes comprising contacting one or more gene sequences 
corresponding to the cDNAs as identified in Table 1 with an agent that 
modulates expression of more than one gene in such group and thereby 
determining a subset of genes of said group. 

20 In accordance with the present invention, said functionally related 

genes are genes modulating the same metabolic pathway or said genes are 
genes encoding functionally related polypeptides. In one such embodiment, 
said genes are genes whose expression is modulated by the same 
transcriptional activator or enhancer sequence, especially where said 

25 transcriptional activator or enhancer increases, or otherwise modulates, the 
activity of a gene corresponding to a cDNA as identified in Table 1 . 

The present invention also relates to a process that comprises a 
method for producing a product comprising identifying an agent according to 
30 one of the disclosed methods for identifying such an agent (i.e., the 
therapeutic agents identified according to the assay procedures disclosed 
herein) wherein said product is the data collected with respect to said agent 
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as a result of said identification process, or assay, and wherein said data is 
sufficient to convey the chemical character and/or structure and/or properties 
of said agent. For example, the present invention specifically contemplates a 
situation whereby a user of an assay of the invention may use the assay to 
5 screen for compounds having the desired enzyme modulating activity and, 
having identified the compound, then conveys that information (i.e., 
information as to structure, dosage, etc) to another user who then utilizes the 
information to reproduce the agent and administer it for therapeutic or 
research purposes according to the invention. For example, the user of the 

10 assay (user 1) may screen a number of test compounds without knowing the 
structure or identity of the compounds (such as where a number of code 
numbers are used the first user is simply given samples labeled with said 
code numbers) and, after performing the screening process, using one or 
more assay processes of the present invention, then imparts to a second user 

15 (user 2), verbally or in writing or some equivalent fashion, sufficient 
information to identify the compounds having a particular modulating activity 
(for example, the code number with the corresponding results). This 
transmission of information from user 1 to user 2 is specifically contemplated 
by the present invention. 

20 

In accordance with the foregoing, the present invention relates to a 
method for producing test data with respect to the anti-neoplastic activity of a 
compound comprising: 

(a) contacting a compound with a cell that expresses at least one gene 
25 corresponding to a polynucleotide comprising a nucleotide sequence of serial 

number 1-229 (breast), 230-440 (colon), 441-656 (lung) and 657-805 
(prostate) of Table 1 or encoding a polypeptide or transcript of SEQ ID NO: 
806-923 and under conditions promoting expression of said gene; 

(b) detecting a change in expression of said gene compared to 
30 expression when said compound is not present; and 

(c) producing test data with respect to the gene modulating activity of 
said compound based on a change in the expression of the determined gene, 
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or genes, whose expression is otherwise elevated in a non-cancerous cell 
over that in a cancerous cell and a decrease in the expression of the 
determined gene, or genes whose expression is otherwise increased in a 
cancerous cell over that in a non-cancerous cell indicating anti-neoplastic 
5 activity. 

In another embodiment, the present invention provides a method for 
monitoring the progress of a cancer treatment, such as where the methods of 
the invention permit a determination that a given course of cancer therapy is 

10 or is not proving effective because of an increased or decreased expression 
of a gene, or genes, disclosed herein. For example, where there is an 
increased copy number of one or more of the genes as identified in Table 1 
(SEQ ID NO: 1 - 805), monitoring of such genes can predict success or 
failure of a course of therapy, such as chemotherapy, or predict the likelihood 

15 of a relapse based on elevated activity or expression of one or more of these 
genes following such course of therapy. 

In accordance with the foregoing, the present invention contemplates a 
method for determining the progress of a treatment for cancer in a patient 
20 afflicted with cancer, following commencement of a cancer treatment on said 
patient, comprising: 

(a) determining in said patient a change in expression of one or more 
genes corresponding to a polynucleotide comprising a nucleotide sequence of 
serial number 1-229 (breast), 230-440 (colon), 441-656 (lung) and 657-805 

25 (prostate) of Table 1 or encoding a polypeptide or transcript of serial number 
806-923 of Table 1 (which include any of SEQ ID NO: 1 - 923) and under 
conditions promoting expression of said one or more genes; and 

(b) detecting a change in expression of said gene compared to 
expression of said one or more determined genes prior to commencement of 

30 said cancer treatment; 

thereby determining the progress of said treatment. 
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In a preferred embodiment, the detected change in expression is a 
decrease in expression. In another preferred embodiment, the cancer 
treatment is treatment with a chemotherapeutic agent, especially an agent 
that modulates, preferably decreases, expression of a gene identified herein, 
5 such as where said agent was first identified as having antineoplastic activity 
using a method of the invention. Thus, in accordance with this aspect of the 
present invention, a patient, or even a research animal, such as a mouse, rat, 
rabbit or primate, afflicted with cancer, including a cancer induced for 
research purposes, is introduced to a cancer treatment regimen, such as 

1 0 administration of an anti-cancer agent, including one first identified as having 
antineoplastic activity by one or more of the screening methods disclosed 
herein. The progress and success or failure of such treatment is subsequently 
ascertained by determining the subsequent expression of one or more, 
preferably at least 3, or 5, or 10, of the genes identified herein, or that 

15 encodes a transcript or polypeptide disclosed herein (see Table 1) following 
said treatment In a preferred embodiment, a treatment that reduces said 
expression is deemed advantageous and may then be the basis for continuing 
said treatment. The methods of the invention thereby provide a means of 
continually monitoring the success of the treatment and evaluating both the 

20 need, and desirability, of continuing said treatment. In addition, more than one 
said treatment may be administered simultaneously without diminishing the 
value of the methods of the invention in determining the overall success of 
such combined treatment. Thus, more than one said antineoplastic agent 
may be administered to the same patient and overall effectiveness 

25 ascertained by the recited methods. 

In accordance with the foregoing, the present invention also 
contemplates a method for determining the likelihood of survival of a patient 
afflicted with cancer, following commencement of a cancer treatment on said 
30 patient, comprising: 

(a) determining in said patient a change in expression of one or more 
genes corresponding to a polynucleotide comprising a nucleotide sequence of 
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serial number 1-229 (breast), 230-440 (colon), 441-656 (lung) and 657-805 
(prostate) of Table 1 or encoding a polypeptide or transcript of serial number 
806-923 of Table 1 and under conditions promoting expression of said one or 
more genes; and 

5 (b) detecting a change in expression of said gene compared to 

expression of said one or more determined genes prior to commencement of 
said cancer treatment; 

thereby determining the likelihood of survival of said treatment. 

10 In a preferred embodiment, the detected change in expression is a 

decrease in expression and said determined gene, or genes, may include 2, 
3, 5, 10 or more of the genes described herein. Thus, the methods of the 
invention may be utilized as a means for compiling cancer survival statistics 
following one or more, possibly combined, treatments for cancer as in keeping 

1 5 with the other methods disclosed herein. 

The genes identified herein also offer themselves as pharmacodynamic 
markers (or as pharmacogenetic and/or surrogate markers), such as for 
patient profiling prior to clinical trials and/or targeted therapies, including 

20 combination treatments, resulting from the identification of these genes as 
valid gene targets for chemotherapy based on the screening procedures of 
the invention. In one embodiment thereof, the likelihood of success of a 
cancer treatment with a selected chemotherapeutic agent may be based on 
the fact that such agent has been determined to have expression modulating 

25 activity with one or more genes identified herein, especially where said genes 
have been identified as showing elevated expression levels in samples from a 
prospective patient afflicted with cancer. Methods described elsewhere herein 
for determining cancerous status of a cell may find ready use in such 
evaluations. 

30 

It should be cautioned that, in carrying out the procedures of the 
present invention as disclosed herein, any reference to particular buffers, 
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media, reagents, cells, culture conditions and the like are not intended to be 
limiting, but are to be read so as to include all related materials that one of 
ordinary skill in the art would recognize as being of interest or value in the 
particular context in which that discussion is presented. For example, it is 
5 often possible to substitute one buffer system or culture medium for another 
and still achieve similar, if not identical, results. Those of skill in the art will 
have sufficient knowledge of such systems and methodologies so as to be 
able, without undue experimentation, to make such substitutions as will 
optimally serve their purposes in using the methods and procedures disclosed 
1 0 herein. 

The present invention will now be further described by way of the 
following non-limiting example. In applying the disclosure of the example, it 
1 5 should be kept clearly in mind that other and different embodiments of the 
methods disclosed according to the present invention will no doubt suggest 

themselves to those of skill in the relevant art. 

« 

20 EXAMPLE 

Cancerous cells that over-express one or more of the genes selected 
from those that correspond to genes as identified in Table 1 (serial number 1- 
229 (breast), 230-440 (colon), 441-656 (lung) and 657-805 (prostate); serial 

25 number 806-923 (transcript or protein), or SEQ ID NO: 1 - 805 and 855 - 923) 
are grown to a density of 10 5 cells/cm 2 in Leibovitz's L-15 medium 
supplemented with 2 mM L-glutamine (90%) and 10% fetal bovine serum. The 
cells are collected after treatment with 0.25% trypsin, 0.02% EDTA at 37°C for 
2 to 5 minutes. The trypsinized cells are then diluted with 30 ml growth 

30 medium and plated at a density of 50,000 cells per well in a 96 well plate (200 
nl/well). The following day, cells are treated with either compound buffer 
alone, or compound buffer containing a chemical agent to be tested, for 24 
hours. The media is then removed, the cells lysed and the RNA recovered 
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using the RNAeasy reagents and protocol obtained from Qiagen. RNA is 
quantitated and 10 ng of sample in 1 jil are added to 24 nl of Taqman reaction 
mix containing 1X PCR buffer, RNAsin, reverse transcriptase, nucleoside 
triphosphates, amplitaq gold, tween 20, glycerol, bovine serum albumin (BSA) 
5 and specific PCR primers and probes for a reference gene (18S RNA) and a 
test gene (Gene X). Reverse transcription is then carried out at 48°C for 30 
minutes. The sample is then applied to a Perlin Elmer 7700 sequence 
detector and heat denatured for 10 minutes at 95°C. Amplification is 
performed through 40 cycles using 15 seconds annealing at 60°C followed by 
10 a 60 second extension at 72°C and 30 second denaturation at 95°C. Data 
files are then captured and the data analyzed with the appropriate baseline 
windows and thresholds. 



The quantitative difference between the target and reference genes is 
1 5 then calculated and a relative expression value determined for all of the 
samples used. This procedure is then repeated for each of the target genes in 
a given signature, or characteristic, set and the relative expression ratios for 
each pair of genes is determined (i.e., a ratio of expression is determined for 
each target gene versus each of the other genes for which expression is 
20 measured, where each gene's absolute expression is determined relative to 
the reference gene for each compound, or chemical agent, to be screened). 
The samples are then scored and ranked according to the degree of alteration 
of the expression profile in the treated samples relative to the control. The 
overall expression of the set of genes relative to the controls, as modulated by 
25 one chemical agent relative to another, is also ascertained. Chemical agents 
having the most effect on a given gene, or set of genes, are considered the 
most anti-neoplastic. 
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WHAT IS CLAIMED IS: 

1. A method for diagnosing cancer in a mammal, comprising 
determining amplification of a gene in the genome of a mammal wherein said 

5 gene is a gene of Table 1 . 

2. The method of claim 1 wherein said cancer is a member selected 
from breast cancer, colon cancer, lung cancer, prostate cancer, ovarian 
cancer, pancreatic cancer, cervical cancer and kidney cancer. 

10 

3. The method of claim 1 wherein said gene of Table 1 is a gene that 
encodes the same gene product as a polynucleotide selected from the 
polynucleotides of SEQ ID NO: 1 - 805 and 855 - 923. 

15 4. The method of claim 1 wherein said mammal is a human patient. 

5. A method for diagnosing cancer or a pre-cancerous condition in a 
mammal, comprising: 

(a) obtaining a cell or tissue sample from a mammal suspected of 
20 having cancer or a pre-cancerous condition and determining for said sample 

the gene copy number of a gene of Table 1 ; 

(b) comparing said gene copy number of step (a) to the gene copy 
number of the same gene from a sample of a corresponding cell or tissue 
from a mammal of the same species not having cancer of the type being 

25 diagnosed 

whereby a higher gene copy number determined in step (a) relative 
to that in step (b) indicates the presence of a cancer or pre-cancerous 
condition in the mammal of step (a) and results in a diagnosis of cancer or a 
pre-cancerous condition in said mammal. 

30 

6. The method of claim 5 wherein said mammal is a human patient. 
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7. The method of claim 5 wherein said cancer is a member selected 
from breast cancer, colon cancer, lung cancer, prostate cancer, ovarian 
cancer, pancreatic cancer, cervical cancer and kidney cancer. 

5 8. The method of claim 5 wherein the gene of Table 1 is a gene that 

encodes the same gene product as a polynucleotide of SEQ ID NO: 1 - 805 
and 855- 923. 

9. A method of inhibiting cancer, or a pre-cancerous condition, in a 
10 mammalian cell, comprising contacting said cell with a molecule that inhibits 

function of a gene of Table 1 . 

10. The method of claim 9 wherein said gene of Table 1 is a gene that 
encodes the same gene product as a polynucleotide of SEQ ID NO: 1 - 805 

15 and 855 -923. 

11. The method of claim 9 wherein said molecule inhibits gene function 
by binding to said gene. 

20 12. The method of claim 9 wherein said molecule inhibits gene function 

by binding to an RNA encoded by said gene. 

13. The method of claim 9 wherein said molecule inhibits gene function 
by binding to polypeptide encoded by said gene. 

25 

14. The method of claim 9 wherein said molecule is a member selected 
from an antisense DNA, an antisense RNA, a ribozyme and an siRNA. 

15. The method of claim 9 wherein said cancer is a member selected 
30 from breast cancer, colon cancer, lung cancer, prostate cancer, ovarian 

cancer, pancreatic cancer, cervical cancer and kidney cancer. 

16. The method of claim 9 wherein said contacting occurs in vivo. 



82 



WO 2004/091548 



PCT7US2004/009289 



17. A method for identifying an agent having therapeutic activity in a 
human patient in need of said therapeutic activity, comprising: 

(a) determining in a sample from a patient the level of a gene product 
5 encoded by a gene of Table 1 prior to administering a test compound to said 

patient; 

(b) administering said test compound to said patient; 

(c) determining in a sample from said patient the level of a gene 
product encoded by the same the gene as in step (a) 

10 wherein a decrease in the level of said gene product in step (c) relative 

to step (a) identifies said test compound as an agent having therapeutic 
activity. 

18. The method of claim 17 wherein said therapeutic activity is 
15 anticancer activity. 

19. The method of claim 17 wherein said cancer is a member selected 
from breast cancer, colon cancer, lung cancer, prostate cancer, ovarian 
cancer, pancreatic cancer, cervical cancer and kidney cancer. 

20 

20. The method of claim 17 wherein said gene product is an RNA. 

21. The method of claim 17 wherein said gene product is a 
polypeptide. 

25 

22. The method of claim 21 wherein said determination of said 
polypeptide is a determination of an enzyme activity. 

23. The method of claim 17 wherein said gene of Table 1 is a gene that 
30 encodes the same gene product as a polynucleotide of SEQ ID NO: 1 - 805 

and 855 - 923. 
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24. The method of claim 17 wherein said molecule is a member 
selected from an antisense DNA, an antisense RNA, a ribozyme and an 
siRNA. 

5 25. A method for identifying an antineoplastic agent, comprising: 

(a) contacting a test compound with a cell that expresses a gene of 
Table 1 ; and 

(b) determining a change in gene expression as a result of said 
contacting; 

10 whereby said change in said gene expression identifies said test 

compound as an antineoplastic agent. 

26. The method of claim 25 wherein said change in expression is a 
decrease in expression. 

15 

27. The method of claim 25 wherein said contacting occurs in vivo. 

28. The method of claim 25 wherein said gene of Table 1 encodes the 
same gene product as a polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. 

20 

29. The method of claim 25 wherein said molecule is a member 
selected from an antisense DNA, an antisense RNA, ribozyme, an siRNA, a 
small organic molecule and an antibody. 

25 30. A method for determining the cancerous status of a cell, comprising 

determining elevated expression in said cell of a gene of Table 1 wherein 
elevated expression of said gene indicates that said cell is cancerous. 

31. The method of claim 30 wherein said elevated expression is an 
30 elevated copy number of the gene. 
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32. The method of claim 30 wherein said gene of Table 1 encodes the 
same gene product as a polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. 

33. A method for identifying a compound as an anti-neoplastic agent, 
5 comprising: 

(a) contacting a test compound with a polypeptide encoded by a gene 
of Table 1, 

(b) determining a change in a biological activity of said polypeptide due 
to said contacting, 

10 wherein a change in activity identifies said test compound as an agent 

having antineoplastic activity. 

34. The method of claim 33 wherein said gene of Table encodes the 
same gene product as a polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. 

15 

35. The method of claim 33 wherein said change in biological activity is 
a decrease in biological activity. 

36. The method of claim 33 wherein said biological activity is an 
20 enzyme activity. 

37. The method of claim 36 wherein said enzyme is selected from 
kinase, protease, peptidase, phosphodiesterase, phosphatase, 
dehydrogenase, reductase, carboxylase, transferase, deacetylase and 

25 polymerase. 

38. The method of claim 37 wherein said kinase is a protein kinase. 

39. The method of claim 37 wherein said kinase is a serine or 
30 threonine kinase. 
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40. The method of claim 37 wherein said kinase is a receptor tyrosine 
protein kinase. 

41. The method of claim 37 wherein said protease is a serine protease, 
5 cysteine protease or aspartic acid protease. 

42. The method of claim 37 wherein said transferase is a 
methyltransferase. 

10 43. The method of claim 42 wherein said methyl transferase is a 

cytidine methyltransferase or an adenine methyltransferase. 

44. The method of claim 37 wherein said deacetylase is a histone 
deacetylase. 

15 

45. The method of claim 37 wherein said carboxylase is a y- 
carboxylase. 

46. The method of claim 37 wherein said peptidase is a zinc peptidase. 

20 

47. The method of claim 37 wherein said polymerase is a DNA 
polymerase. 

48. The method of claim 37 wherein said polymerase is a RNA 
25 polymerase. 

49. The method of claim 33 wherein said biological activity is a 
membrane transport activity. 

30 50. The method of claim 33 wherein said polypeptide is a cation 

channel protein, an anion channel protein, a gated-ion channel protein or an 
ABC transporter protein. 
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51'. The method of claim 33 wherein said polypeptide is an integrin. 



52. The method of claim 33 wherein said polypeptide is a Cytochrome 
P450 enzyme. 

5 

53. The method of claim 33 wherein said polypeptide is a nuclear 
hormone receptor. 

54. The method of claim 33 wherein said biological activity is a receptor 
10 activity. 

55. The method of claim 33 wherein said receptor is a G-protein- 
coupled receptor. 

15 56. The method of claim 33 wherein said polypeptide is contained in a 

cell. 

57. The method of claim 33 wherein said molecule is a member 
selected from antisense DNA, an antisense RNA, a ribozyme, an siRNA, a 

20 small organic molecule and an antibody. 

58. The method of claim 57 wherein said antibody is specific for a 
polypeptide comprising an amino acid sequence of SEQ ID NO: 806 - 854. 

25 59. A method for identifying an antineoplastic agent comprising 

contacting a cancerous cell with a compound found to have anti-neoplastic 
activity in the method of claim 59 under conditions promoting the growth of 
said cell and detecting a change in the activity of said cancerous cell. 

30 60. The method of claim 59 wherein said change in activity is a 

decrease in the rate of replication of said cancerous cell. 
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61 . The method of claim 59 wherein said change in activity is the death 
of said cancerous cell. 

62. A method for treating cancer comprising contacting a cancerous 
5 cell with an agent first identified as having gene modulating activity using the 

method of claim 25, 33, or 59 and in an amount effective to cause a reduction 
in cancerous activity of said cell. 

63. The method of claim 62 wherein said cancerous cell is contacted in 

10 vivo. 

64. The method of claim 62 wherein said reduction in cancerous 
activity is a decrease in the rate of proliferation of said cancerous cell. 

15 65. The method of claim 62 wherein said reduction in cancerous 

activity is the death of said cancerous cell. 

66. The method of claim 62 wherein said cancer is a member selected 
from breast cancer, colon cancer, lung cancer, prostate cancer, ovarian 

20 cancer, pancreatic cancer, cervical cancer and kidney cancer. 

67. A method for treating cancer comprising contacting a cancerous 
cell with an agent having affinity for an expression product of a gene of Table 
1 and in an amount effective to cause a reduction in cancerous activity of said 

25 cell. 

68. The method of claim 67 wherein said expression product is a 
polypeptide. 

30 69. The method of claim 67 wherein said molecule is a member 

selected from antisense. DNA, an antisense RNA, a ribozyme, an siRNA. a 
small organic molecule and an antibody. 
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70. The method of claim 69 wherein said antibody is specific for a 
polypeptide comprising an amino acid sequence selected from SEQ ID NO: 
806 - 854. 

5 71 . A method for monitoring the progress of cancer therapy in a patient 

comprising monitoring in a patient undergoing cancer therapy the expression 
of a gene of Table 1. 

72. The method of claim 71 wherein said gene encodes the same gene 
10 product as a polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. 

73. The method of claim 71 wherein said cancer therapy is 
chemotherapy. 

15 74. The method of claim 71 wherein said cancer is a member selected 

from breast cancer, colon cancer, lung cancer, prostate cancer, ovarian 
cancer, pancreatic cancer, cervical cancer and kidney cancer. 

75. A method for determining the likelihood of success of cancer 
20 therapy in a patient, comprising monitoring in a patient undergoing cancer 
therapy the expression of a gene of Table 1 wherein a decrease in said 
expression prior to completion of said cancer therapy is indicative of a 
likelihood of success of said cancer therapy. 

25 76. The method of claim 75 wherein said gene encodes the same gene 

product as a polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. 

77. The method of claim 75 wherein said cancer therapy is 
chemotherapy. 

30 
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78. The method of claim 744 wherein said cancer is a member 
selected from breast cancer, colon cancer, lung cancer, prostate cancer, 
ovarian cancer, pancreatic cancer, cervical cancer and kidney cancer. 

79. A method for producing test data with respect to the anti-neoplastic 
5 activity of a compound comprising: 

(a) identifying a test compound as having anti-neoplastic activity using 
a method of claim 25; 

(b) producing test data with respect to the anti-neoplastic activity of 
said test compound sufficient to identify the chemical structure of said test 

10 compound. 

80. A method for producing test data with respect to the anti-neoplastic 
activity of a compound comprising: 

(a) identifying a test compound as having anti-neoplastic activity using 
1 5 a method of claim 33; 

(b) producing test data with respect to the anti-neoplastic activity of 
said test compound sufficient to identify the chemical structure of said test 
compound. 

20 81 . A method for determining the progress of a treatment for cancer in 

a patient afflicted therewith, following commencement of a cancer treatment 

on said patient, comprising: 

(a) determining in said patient a change in expression of one or more 

genes of Table 1; and 
25 (b) determining a change in expression of said gene compared to 

expression of said one or more determined genes prior to said cancer 

treatment; 

wherein said change in expression indicates progress of said treatment 
thereby determining the progress of said treatment. 

30 
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82. The method of claim 81 wherein said change in expression is a 
decrease in expression and said decrease indicates success of said 
treatment. 

5 83. The method of claim 81 wherein said gene encodes the same gene 

product as a polynucleotide of SEQ ID NO: 1 - 805 and 855 - 923. 
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